Frequent substructure-based approaches for classifying chemical compounds
نویسندگان
چکیده
منابع مشابه
Finding Frequent Substructures in Chemical Compounds
The discovery of the relationships between chemical structure and biological function is central to biological science and medicine. In this paper we apply data mining to the problem of predicting chemical carcinogenicity. This toxicology application was launched at IJCAI’97 as a research challenge for artificial intelligence. Our approach to the problem is descriptive rather than based on clas...
متن کاملAnalysing and Classifying Names of Chemical Compounds with CHEMorph
We present a prototypical system with a purely linguistic method to analyse organic chemical compound names. It morpho-semantically analyses compound names, generates line-based, machinereadable representations of their corresponding molecular structures (SMILES strings), and triggers a taxonomic classification. CHEMorph is to be used to support manual database curation and as a basis for bioch...
متن کاملBinary Substructure Descriptors for Organic Compounds*
Organic chemical structures are represented by binary vectors that contain information about presence or absence of 1365 substructures. The guiding ideas for selecting this set of substructures are described and examples are given. Software SubMat has been developed for a fast and flexible computation of binary substructure descriptors from molecular structures. Examples from structure similari...
متن کاملAutomated Approaches for Classifying Structures
In this paper we study the problem of classifying chemical compound datasets. We present an algorithm that first mines the chemical compound dataset to discover discriminating sub-structures; these discriminating sub-structures are used as features to build a powerful classifier. The advantage of our classification technique is that it requires very little domain knowledge and can easily handle...
متن کاملPattern Recognition Approaches for Classifying IP Flows
The assignment of an IP flow to a class, according to the application that generated it, is at the basis of any modern network management platform. However, classification techniques such as the ones based on the analysis of transport layer or application layer information are rapidly becoming ineffective. Moreover, in several network scenarios it is quite unrealistic to assume that all the cla...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Transactions on Knowledge and Data Engineering
سال: 2005
ISSN: 1041-4347
DOI: 10.1109/tkde.2005.127